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Abstract 

£\| This paper presents a new class of 8 th order canonical polynomials in R". Based on the 

sequential canonical dual transformation, all extrema are obtained. The method can be used to 
' solve the associated 7 th order nonlinear algebraic equations. Optimality conditions are provided 

to find both global minimizers and local extrema. Applications are illustrated by examples. 
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1 Primal Problem and Motivation 

This paper intends to solve the following 8 th order polynomial minimization problem 



(V n ) min {P n (x) = U(x) - x T h| x G R"} , (1) 

where 

V(x) = \a 2 (y 2 (y 1 ^))) 2 +b 2V2 (y 1 {^))+c 2 , (2) 

2/2(2/1) = ^{yxf + b lVl + c u (3) 

2/i (x) = ia ||x|| 2 +x T bo + c , (4) 

. 2 

. and aQ,co,ai,bi,a (i = 1,2) are given constants, h,bo G R™ are given vectors. In this paper, we 

assume that > (i — 0, 1, 2) because {V n ) is a minimization problem. 

The global optimization problem (Vn) appears extensively in engineering physics and materials 
^ science: examples include phase transitions of solids [7], shape- memory alloys [12], and superconduc- 

tivity [9]. From a systems theory point of view, if the vector h e M™ is considered as a source (input) 
variable, say the external force in structural mechanics, then the linear term x T h can be viewed as the 
"external energy" and the polynomial U(x) as the "internal (or free) energy" . Clearly, if bo = € R n , 
then U(x) is the so-called objective function [4], i.e., V(Qx) = U(x) for all xeB" and all Q E R nxn 
such that Q^ 1 — Q T and detQ = 1- Moreover, if a 2 = &i = ci = and c = —A < 0, then U(x) is 
the well-known double-well potential 

W(x) = Qa ||x|| 2 - X^j , a = b 2 a x > 0, 

which was first proposed by van der Waals in thermodynamics and which appears in many branches 
of science. In theoretical physics, it provides one of the simplest models of the unified field theory. In 
ferroelectric and ferromagnetic systems, the critical phenomena and the phase transition are modeled 
by this double well potential [U[2j[T0]. It can also be found in the theory of dislocations in metals, the 
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theory of Josephson junctions, nonconvex dynamics, computational biology, and systems of network 
communication [5j[TTl[13] . Particularly, the graph of W(x) in R 2 is the so-called Mexican hat potential, 
which has been used to model the Higgs Mechanism in quantum mechanics and is also referred to 
as the sombrero potential in cosmic strings [3]. In order to model high-order phase transitions, a 
high order van der Waals-type polynomial has been proposed in [2J|B]. Because the polynomial V(x) 
proposed in this paper has more coefficients than the van der Waals-type polynomials, it can be used 
for modeling more complicated phenomena. However, due to its nonconvexity, the problem (V n ) could 
have multiple local minimizers, and finding the globally optimal solution by traditional direct methods 
is considered a fundamentally challenging task in global optimization. 

Canonical duality theory developed in [4] is a potentially useful methodology which can be used 
to solve a relatively large class of nonlinear algebraic and differential equations H] ■ The associated 
triality theory can be used to identify both local extrema and global minimizer (see [5]). The purpose 
of this paper is to generalize this result to solve the global optimization problem (V n )- In this next 
section, we first show that the n-dimensional nonconvex polynomial minimization problem can be 
reformulated by the canonical duality theory into a one-dimensional canonical dual problem, which 
can be solved easily. Therefore, an analytical solution to (V n ) is presented in Sections 2, 3, and 
4. We then discuss the special case of n = 1 in Section 5 and show that the one-dimensional 8 th - 
order polynomial optimization problem can be solved completely by the canonical duality theory to 
obtain all critical points. The extremality behaviors of all these critical solutions can be identified 
by a generalized triality theory. Applications are illustrated in Section 6. The paper ends with some 
conclusion remarks. 

2 -P n ( x ) an d Its Canonical Dual 

From the pattern of the 8 th order polynomial defined in (Jl]), we introduce a geometrical operator 

yi = A^x) = iao||x|| 2 + b T x + co, (5) 
so 2/2 (x) = ?7i(Ai(x)) and U\{y\) = \a\y\ + b\y\ + c\. The dual variable of y\ is defined as 

si = — =aiyi+bi. (6) 

dyi 

Thus (yi, S\) form a canonical duality pair 4 and the Legendre conjugate can be uniquely defined 

by 

E7J(si) = sta{j/isi — Ui(yi)\ y\ £ R} 



(si - 6i)s 



(s 1 -b } y + b 1 (s 1 -b 1 ) 

2ai ai 



Cl 



It is easy to verify that 



dyi dsi 
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We introduce a second geometrical operator 



and let 



2/2 = A 2 (yi) = 2 ai ^i + + Cl 



y(x) = [7 2 (A 2 (2/1 (x))), 



(8) 



where U 2 {y 2 ) = ^a 2 y| + 6 2 y 2 + c 2 . The duality relation between y 2 and s 2 is 

dU2(V2) , . 

S2 = — "j = a 2 J/2 + »2- 

^2/2 



(9) 



Also, (j/ 2 , s 2 ) form a canonical duality pair [4j and the Legendre conjugate U 2 can be uniquely defined 

by 



C/ 2 * (s 2 ) = sta{y 2 s 2 - U 2 (y 2 )\ 2/2 € 
( S2 - & 2 ) 2 



2a, 



C2- 



(10) 



In the same way as before, we can verify that 



dU 2 (y 2 ) dUZ{s 2 ) TJ . , , 

S 2 = J/2 = j C/ 2 (2/ 2 ) + f/ 2 *(s 2 ) = S22/2- 

aj/2 as 2 

Since we can replace V(x) = J7 2 (A 2 (j/i(x))) with A 2 (j/i(x))s 2 — t r 2 ( s 2), the generalized total 
complementary function [5] S : 1™ x K — s- K can be defined as 

S(x,s 2 ) = A 2 (yi(x))s 2 - C/ 2 *(s 2 ) - h T x. 

Similarly, A 2 (yi(x)) — y 2 (x) — t r i(A 1 (x)) can be replaced by A^x^! — U~i(si), so the generalized 
total complementary function S(x, s 2 ) can be reformulated as 



£(x,si,s 2 ) = [Ai(x)si - U^{ Sl )]s 2 - C/*(s 2 ) - h T x 



-a ||x|| 2 + b T x + c ) S! - C/*(si) 



By the criticality condition V x ^(x, sx, s 2 ) — 0, it is easy to obtain 

x = x(si,s 2 ) = — ( — b 



Thus, the canonical dual function P d 



s 2 -t/*(s 2 )-h T x. 



a \sxs 2 
can be defined as 



P d (si,s 2 ) = sta{S(x,si,s 2 )| xel") 



SlS 2 c 



2a n 



SlS 2 



-b 



- s 2 U*(si) - U 2 (s 2 ). 



(11) 

(12) 

(13) 
(14) 



Combining (j6|), ((8|), and (J9j) captures the relation between si and s 2 given si = aij/i + 6i and 
s 2 = a 2 y 2 + b 2 . Therefore we obtain 

2 



, , /I 
s 2 = s 2 (si) = a 2 I -a x I — — — 



« 2 



2ai 



0! h Ci 

ai 

«2 / 2 



2ai 



(15) 
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where 

2a\a 2 C\ + 2ai&2 — 02^1 



H 



3 



«2 

Q 2 I „1 



To eliminate S2, we let cr = si and s 2 = r(cr) = 5^(0' — H3) to convert the total complementary 
function H(x, si, s 2 ) to 

H(x,<r) = Ai(x)<tt(<7) - Ul{a)r{a) - C/ 2 *(r(a)) - h T x. (16) 

Also, (|T2"j) becomes 

x(tr) = x(a, r(a)) = - (-^— - b ) (17) 
and the canonical dual function P d (s±, S2) can be converted to 



P* (fT \ H + fl ^ g2 - g 3) 2 (^) 2 + gi) 
,P (cr) = #4 + r-2 I , 2 TTT 



a 2 a(a 2 - H 3 ) 



(18) 



where 



rr "lllhll 

-Hi = 



#2 = 
#4 = 



a Q 

2a aico + 2a 6i - ai||bo|| 2 
2a 

2a a 2 c 2 + 2a 2 b T h — a b 2 



2aoa 2 

/ ; . , n t /T - — I I , \ 

$(a) 2 = 2[oT(cT)Y(a~H 2 )=2 



a 2 a(a 2 - H 3 ) 



(a - H 2 ). 



2oi 

It can be seen from (|T5)) that P d (a) is asymptotic only when h ^ 0. By the criticality condition 

dP d {v) = a2(i^-H,){H l -^{af) = Q 
d<J [2ai(jr(cr)] 2 

we obtain the following canonical dual algebraic equation 

$(cr) 2 = 2[ar{a)} 2 (a-H 2 ) = H x . (20) 

3 Complete Solutions 

From the canonical duality theory [4], we have the following result: 

Theorem 1 Let a G M 6e a solution of the dual algebraic equation t20\) . Then, the vector x = x(ct) 
is a critical point o/P n (x) and 

P n {5i) = Z(5i,a)=P d {<T). 

Conversely, every critical point x of P n {n) can be put in form |J7| j for some solution ff£l(o 
again with 

P n (5L)=E(5t,a)=P d (a). 
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Proof. By the definition of the canonical dual function, we know that 

S(x,a)=P rf (a). 



Since a is a solution to $(o~) 2 = Hi, we can combine (fTT|) and (|20[) to get 

5- = aiyi + 6i, (21) 

where 

J/i = J/i(x). 

Plugging this into (jTTJ) gives the criticality condition 

(a 2 y 2 + fcaXaift + &i)(a<)X + b ) - h = 0, (22) 

where 

J/2 = 2/2 (yi), 

and thus x is a critical point of P„(x). Furthermore, 

P n (x) = S(x,0-) = P d (a) 

by construction. 

Conversely, if x is a critical point of P n (x), we know that (|22p is satisfied. Plugging in x = x(cr) 
and simplifying gives us (|20|) . and thus we know that <7 is a solution to the algebraic equation. Ad- 
ditionally, the condition x = x(cr) leads to H(x,ct) = P d (a), and the criticality condition combined 
with (HU) leads to {2TJ and thus P„(x) = 3(x, ct) = P d (o-). This completes the proof. □ 



Remark 1 From \19fl , it is clear that P d {a) has two extra critical points at ±y ^ . However, from 
Theorem 1, we know that only x = x(cr), where a solve I120\) , are critical points o/P„(x). Even though 
are critical points of P d {a), x(±y^^) may not be critical points o/P„(x). These critical 
points were first noticed in Jfjj] but their origin was unexplained. Additionally, it is also clear from 
\19\) that solutions a to H20\) are not necessarily critical points of P d (o~) when h = 0. When h ^ 0, 
all solutions to \20\) are also critical points of P d (o~). 

From this remark we know that the extra critical points a = ±y^- of P d (a) do not correspond 
to critical points of P„(x). Instead, it can be inferred from 

93 (x, a) a 2 (3cr 2 — H 3 ) f a — bi\ d[o-r(a)] f a — b{ 



yi (x) i =-_wi yi(x) i (23 ) 

and (fl6|) that these two critical points are caused by the introduction of the second dual variable s 2 
and its subsequent elimination by substitution, since the term ^"J^^ gives precisely the extra critical 
points ±yj We can define the term "corresponding critical point" to denote a solution a of the 
algebraic equation (|20l) : from Theorem [1] we know that all corresponding critical points a are part of 
an (x, a) pair such that x = x(cr) is a critical point of P(x) and P d (a) = P(x). 

We also know that every critical point x of P„(x) is part of an (x, a) pair such that the above 
conditions hold. 
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3.1 Complete Solutions for the case h = 

From (fT5|) it is clear that a is a critical point of P d (a) and (x, a) is a critical point of S(x, a) if h 7^ 0. 
If h = 0, then a is a critical point of P d (a) and (x, a) is a critical point of S(x, a) iff a = H 2 . The 
other solutions to (f^Uj) . a = and <7 2 = f/3, still correspond to critical points x of P(x) but are not 
themselves critical points of P d (a). Thus, we have: 

Theorem 2 Suppose that x is a critical point o/P„(x) and that h = 0. Then, either x is a solution 
to aij/i + b\ = and 



P„(x) = tf 4 + ^t, 



'1 



aij/i + b\ = H 2 and 



or (aij/i + &1) 2 = i/3 and 



P n (5L) = H i+ a ^- H3) 



8aj 



P n (x) = tf 4 . 

Furthermore, any x £/iai satisfies (aij/i + &1) 2 = is a global minimizer o/P„(x). 

Proof. From Theorem[TJ we know that P n (x) = S(x,<r) = P d (a), where a = aij/i + 61. Additionally, 
a is a solution to ([20]) . so we know that at least one of the following must be true: 

aivi+bi = 
aiyi +bi = H 2 
(axyi+bi) 2 = H 3 . 



Furthermore, 



P„(x)=P d (0) = // 4 + £ 



P n (x) = P d {H 2 ) = H 4 + 



8af 

P„(x) = P d (±^/lh) = Hi 

in each of those cases, respectively. Thus, it is clear that global minimizers x of P ra (x) satisfy the 
condition {a\y\ + &1) 2 = H 3 . Additionally, since H 2 < a, we also know that a = only if i? 2 < and 
that a 2 = H 3 only if H 2 < #3 . This completes the proof. □ 



4 Global Extremality Conditions 

In order to identify both global and local extrema among all critical points, sufficient conditions are 
fundamentally important in global optimization. It can be seen from (p?Tj) or (|!20|) that H 2 < so we 
can assert that a G S a , where 

S a = {ae R\ H 2 < a}. 



G 



Additionally, if wc define a region 

S+ = {a £ S a \$l(i/H~ 3 ) < a}, (24) 
where ^t(z) is the real part of complex number z, the dual problem is 

(P d ) : max{P d (a) | a £ S+} . (25) 
and we obtain the following theorem. 

Theorem 3 Suppose that a is a critical point of P d {a). If cr £ , then a is a global maximizer of 
P d (o~) on S£ , x = x(cr) is a global minimizer of P ra (x), and 

P„(x) = min P n (x) = max P d (a) = P d (a). 

Proof. On the region a £ <S+, < min{ ^"T^ ; cr, t(ct)}. Thus, we know from (|2"3"|) that 

P ra (x) = max S(x, cr), 

and we can see from (TIT))) that P d (a) is concave on cr G <S+, thus a is a global maximizer of P d (a) on 
cr G <S+ . It is clear that the total complementary function 

S(x,<t) = Ai(x)trr(a) - C/r(cr)r(cr) - U;(r(a)) - h T x 

is convex in x G K™, so we have 

P d (a) — max P d {u) = max min S(x, cr) 

crGS+ creSt X 6 R ™ 

= min max S(x, cr) = minP„(x) = P„(x). 
This completes the proof. □ 
From (fT9|) . it is clear that P d (a) only has a critical point on <S>+ at a solution to (|20|) . Since 

"a 2 cr(CT 2 - #3)" 



$(cr)^ = 2 



2ax 



(<r - H 2 ), 



it is equally clear that $(cr) 2 is monotonically increasing on and that $(3t(\/ P3)) 2 — 0. Thus, 
P d (o~) is concave on 5+ and, if h ^ 0, there is a guaranteed unique corresponding critical point cr on 
<S+. From Theorem [3J we know that if h = 0, there is no unique global minimizer of P n (x). Instead, 
all solutions to aiyi(x) + 61 = -h^/H^ are global minimizers of P n (x). 

5 One-Dimensional Problem 

In the case case that n = 1, both h and bo are given constants and (P„) is presented as an extremum 
problem 

(P) ext {P(x) = V(x) - hx\ x £ M} , (26) 
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the notation ext{*} stands for finding all extremum solutions. From Theorem [TJ and the canonical 
duality theory, we know that (P) corresponds to a dual problem 



(T d ) : ext{V d (a) | del}. 



(27) 



There is a natural dichotomy of problems of this type: the addition of the linear term hx in P(x) 
breaks the inherent symmetry and ensures one unique global minimizer of P{x). Theorem [5] can be 
used to solve (V) in the h = case. 

5.1 Local Extrema for h ^ 

In the case where h ^ 0, we have the following theorem. 

Theorem 4 Suppose that h ^ 0. The corresponding critical points a of P d (o~), if they exist, lie in 
the regions defined by 



where $t(z) is the real part of complex number z. Furthermore, each of the regions S~ , Si, and S 2 , if 
non-empty, contains exactly one critical point, a maximizer, o/$(ct) 2 . Finally, $(cr) 2 is convex and 
monotone increasing on S~^ ■ 

Proof. Because corresponding critical points a of P d {a) satisfy (f20|) . it is clear that a € iS~U<SiU<S>2U6> + 
when h 7^ 0, since Se5 a and 



Thus a 7^ H 2 and af ^ 0. 

Because &(a) 2 > and $(cr) 2 = at a = H 2 and at ar(a) = 0, it follows that each non-empty 
region S~ , Si , and 52 contains at least one maximizer of < &(ct) 2 . Thus, if it can be shown that S~ , Si , 
or £2 contains exactly one critical point of $(er) 2 , that critical point must be a maximizer of $(cr) 2 . 
Since 




)} 



Hi 



d[$(a) 2 ] 



2o~r(<7) ctt(ct) + 2(cr - H 2 ) 



d[ar(a)} 



da 



da 



—aT{a)(7a 3 - 6H 2 a 2 - 3H 3 a + 2H 2 H 3 ) 



—aT(a)Q(a) 7 
ai 



with 



dQ(a) 
da 



Q{a) = 7a 3 - 6H 2 a 2 - 3H 3 a + 2H 2 H 3 
Q'(a) = 3{7a 2 - AH 2 a - H 3 ), 
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it follows that critical points of &(<j) 2 occur in S a , Si, and 52 only when Q(a) = and that critical 
points of Q(a) occur at 

± _ 4H 2 ± y/!6Hj + 28H 3 2H 2 ± y/4Hj + 7H 3 
° ~ 14 = 7 ■ (28) 

If S~ £ 0, then H 2 < SR(— /HJ) < 0. When H 3 = 0, it is clear that $(ct) 2 has exactly one critical 
point on S~ at the point a = ^y 2 -. When H 3 > 0, it is equally clear that S\ and 52 are non-empty, 
and S~ , Si and S2 each contains at least one maximizer of &(<j) 2 since we assume that S~ 7^ 0. 
Because Q{<r) has at most three roots, it follows that S~ , Si, and S2 each have exactly one critical 
point, a local maximizer, of &{<j) 2 ■ When H 3 < 0, we know from (|28|) that ^y 2 - < cr ± < and, since 
the expressions a 2 (7cr — 6H 2 ) and H 3 (2H 2 — 3a) are both positive on this range of values, we know 
that (ct ± ) 2 (7ct ± - 6H 2 ) + H 3 (2H 2 - 3ct ± ) = Q(cr ± ) > 0. Consequently, $(cr) 2 will have exactly one 
critical point on S~ if 

Q(H 2 )Q (»(-V^)) = Q(H 2 )Q(0) < 0. 
Calculating Q(a) at these points gives 

Q(H 2 ) = H 2 (H 2 - H 3 ) < 
Q{0) = 2H 2 H 3 > 0, 

so S~ contains exactly one critical point of $(cr) 2 when S~ 7^ 0. 
Similarly, when Si 7^ 0, 

#2 < 

#3 > 

Si = {a e R\ ■max{H 2 ,-^/H 3 '} < a < 0}. 

Since it has already been shown that S~, Si, and S2 each contains exactly one critical point of $(ct) 2 
when none of them are empty, and since S2 7^ when Si 7^ 0, we can focus on the case where S~ = 0. 
This implies that —y/Hj, < H 2 . From ([28]) . we know that cr + > <^ Si, so 

Q(H 2 )Q(0) < 

is a sufficient condition to prove that &(<r) 2 has exactly one critical point on Si. Since 

Q(H 2 ) = H 2 (H 2 - H 3 ) > 
Q{0) = 2H 2 H 3 < 0, 

it only remains to be shown that $(<r) 2 has only one critical point on Si when Q(H 2 ) = 0. In this 
case, 

Q'0)| CT =H 2 =6iff >0, 

so we know that Q(u) > for some a € Si. Because Q{u) is continuous and has at most one critical 
point on Si, it follows that <&(cr) 2 has only one critical point on Si when Si 7^ 0. 
Thirdly, when S 2 7^ 0, 

H 3 > 
H 2 < \f~Eh, 

S 2 = {a e R| max{£T 2 , 0} < a < y/lh}. 
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When H 2 < 0, it is clear from ([25)1 that Q(a) has at most one critical point on S2, so 

Q(0)Q(VH~ 3 ) < 

is a sufficient condition to prove that 3>(cr) 2 has exactly one critical point on 6>2- Since 

Q{0) = 2H 2 H 3 < 
Q{^Ih)=AH 3 {^h-H 2 ) > 

and Q'(cr)| CT= o = —3H 3 < 0, there must be exactly one critical point of §(cr) 2 on S 2 when H 2 < 0. 
When H 2 > 0, there is at most one critical point of Q{a) in S 2 since H 3 > and o~ < 0. Thus, it is 
sufficient that 

Q{H 2 )Q{^%) < 
holds in order for $(ct) 2 to have exactly one critical point on S 2 . Because 

Q(H 2 ) = H 2 (H 2 - H 3 ) < 
Q(V#3) = 4H 3 (y/lh - H 2 ) > 0, 

$(a) 2 has exactly one critical point on S 2 when S 2 7^ 0. 

Finally, $(ct) 2 is convex on S+ because a > 0, r(cr) > 0, d[a ^ a)] > 0, > , and H 2 < a 

for a G <S+. Thus, all terms in 



da 2 



2 M ^ +2( ,_ i?2) 



are positive. By the same reasoning, 1 > and $(cr) 2 is also monotone increasing on 5+. This 

completes the proof. 



<jt{ct) 



d 2 [ar{a)} f d[ar(a)} 



da 2 



da 



We can label the maximizers of $(er) 2 on S a ,Si, and £2 as a b , cr 11 , and er', respectively, and define 

5~ + = {aeS-\a<a b } 

S~_ = {aeS-\a>a b } 

S 1+ = {aeS-\a<a*} 

51- = {aeS^\a>a ii } 
S 2+ = {a G S+\ a < a*} 

5 2 - = {a G S+\ a > J}. 

From Theorem |4l we know that ^£ig) > on S~ + , <Si+, and 6>2+ and that < on S^L, <Si-, 

and ^2- . If we set R(<j) to the expression obtained when (fT5)l and (|17l) are substituted into d 

R(a) = I , = 1 f 4- - &0 H = - 6^ 2 - 3H 3 a + 2F aJ ff.) = ^Q(a), 

^ dx^ ao \<tt(ct) / J zai 2ai 

it is clear that Q(a)R(a) > when Q(a) 7^ and, since R(a) — d ^Jf^ , that Q(d')^ L -^t^- > when 
Q(o-) 7^ as well. When Q(a) = 0, then = 0. Thus, we have the following theorem. 
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Theorem 5 Suppose that h =/= 0. Then, each of the regions 5~_j_, 5;± (i = 1,2) contains at most 
one corresponding critical point a of P d (a), 5 + contains exactly one corresponding critical point of 
P d (a), and a G S~± Hi G S~-r., a G Si± Q G Si T , where Q is another corresponding critical 
point of P d (o~). Finally, a G S~< U 5i_ U £2+ if and only if x — x(a) is a local maximizer of P(x), 
a G 5~_ U S\+ U 52- U 5+ if and only if x is a local minimizer of P(x), and a G a 9 U U <r" if and 
only if x is an inflection point of P(x). 

Proof. Since we know that > on 5 a+ U5i+U52+U5 + , and that < on 5~_ U5i_ U52- 

from Theorem [H we also know that each of these regions can have no more than one value at which 
<&(er) 2 = Hi because < &(cr) 2 is continuous over a G K. Since 

x{a) = — [ k - b 
a \o-T(a) 

is a critical point of P(x) if and only if ^(cr) 2 = Hi, it follows that there can be at most one a on each 
region. Also, since ^{^/Hj,) 2 = < H\ and $((t) 2 is monotonically increasing on 5+, there is exactly 
one <7 G 5 + . Because <I>(i/2) 2 = < Hi, cr b > Hi iff 5 f 7+ contains one corresponding critical point. 
Since <&(— y/H^) 2 = < Hi, S~_ contains a corresponding critical point iff tr b > Hi. Therefore, 5~ + 
contains a corresponding critical point <r iff S~_ does as well. Because <&(±vfls) = $(0) = < 
the same logic ensures that 5^+ (i = 1,2) contains a corresponding critical point iff Si- does. 

On 5~ and 52, err (a) < so Q(ct) < on 5^_ U 52+ and Q(ct) > on 5~_ U 52-. By the same 
logic, Q(a) > on 5 1+ U 5+ and Q(ct) < on 5i_. Thus, 

5- + U 5i_ U 5 2 + = {a G 5 a | Q(tr) < 0,ar(a) ^ 0} 
S~_ U 5i+ U 5 2 _ U 5+ = {a G 5 a | Q(<r) > 0, ctt(ct) ^ 0}. 



Consequently, 



Since Hi ^ 0, 



thus 



d 2 P(x) 

3 ' < a G 5~ + U 5i_ U 5 2 + 

d 2 P(x) 

v 2 ' > <S> ctg5-_U5i + U5 2 _U5+. 

cr b U cr 15 U cr» = {a G 5" U 5i U 5 2 U 5+| Q(cr) = 0}, 
d 2 P(x) b h „ 



cfa: 2 

This completes the proof. □ 



5.2 Classification of critical points 

In addition to knowing where critical points x occur, it would also be useful to know how many 
critical points P(x) has based on the particular values of the coefficients. In the case where h = 0, 
this amounts to comparing iJ 2 , H3 and 0: P(x) has seven critical points when H2 < \/Hz) < 0, 
five critical points when 3?(— \/Hz) < H2 < 0, three critical points when < Hi < ^(v^Ba) or 
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H.2 < = 5ft(-v/i?3), and one critical point when < ^.(^/H^) < H^- In other words, P(x) has seven 
critical points when S~ , S%, S2, and 5+ are all non-empty, five critical points when only S~ is empty, 
three critical points when either S~ or 1S2 is the only non-empty region apart from S+, and only one 
critical point when S~ , S±, and S2 are all empty. 

In the case where h 7^ 0, the number of critical points of P(x) also depends on the value of H\: if 
all other coefficients stay constant, P(x) has a maximum number of critical points when Hi = h = 
and only loses critical points as H± increases in value. 

6 Applications 



6.1 Case n=l 

Letting a, = 1 (i = 0, 1, 2), 60 = 3, 61 = 2, 62 = 1, Co = —1.5, c\ = —1, C2 = —5, and h = 2, we have 
P(x) 



1 



3 .7 , 55 « , 117 , 



851 



G9 



128 16 32 
The graph of P(x) is shown in Fig. [TJ 



16 



64 



16 



249 2 77 479 
-a; x - 



32 



16 



128 




Figure 1: Graph of P(x) 



The canonical dual of P(x) has the form 



pd(ff)= l + (a 2 -4) 2 a\^-Af{a + A) + , 



2 8 2cr(cr 2 - 4) 

and the constants Hi, H2, and H3 have the values 

ff x = ^ = 4 

2a aic + 2a bi - aib^ 

£12 = = —4 

2a 

2aia 2 ci + 2aib 2 - a 2 b\ 

H3 = = 4. 

a 2 
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Figure 2: Graphs of P d {a), $(ct), and s/W x 



Its graph, in blue, is shown in Fig. [2] along with $(cr) = ±ar(a)y^2(a — H2) in red and \fiT[ = h^J 2± 
in black. Corresponding critical points a are circled. Since 

Hi = 4 < <$>{o^) 2 < $((T S ) 2 < $(cr b ) 2 , 

there are seven corresponding critical points a of P d (a) and seven critical points x of 




Figure 3: Graphs of x{a) (red) and <j{x) = a\yi(x) + 61 (blue) with x as the abscissa and a as the 
ordinate. 



It can be seen from Fig. [3] that the (x,a) pairs are, in descending value of a, 



(xi,CTi) = 


(.05014,2.1299), 


(x 2 ,a- 2 ) = 


(-6.4157,1.8334), 


(S3, ^3) = 


(-5.9495,0.3497), 


(£4,04) = 


(-0.3117,-0.3864), 


(25,05) = 


(-0.8573,-1.7043), 


(x 6 ,a 6 ) = 


(-4.8837,-2.2258), 


(27,0V) = 


(-3.0836,-3.9965). 
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Figure 4: Graphs of P(x) (blue) and P d (a) (red) for H x < min{$(cr b ) 2 , $(cr^) 2 , $(cr«) 2 } 

If we overlay P(x) and P d (a), as in Fig. [4j we can clearly see that x\ is the global minimizer of 
P(x) on i £ R and that d\ is the global maximizer of P d (a) on a € S£ ■ Thus, x\ is the solution to 
(Vi) and u\ is the solution to (V d ) over cr € <S+. Also, it can be seen both from Figs. [3] and |4] that 
P(x) = P d (a) for every (x, a) since Fig. [3]traces out the regions in the xcr-plane where E(x, cr) = P(x) 
and H(x, cr) = P d (a) in blue and red, respectively. 




-6 -4 -2 2 -6 -4-2 



Figure 5: Clockwise from top left, P(x) and P d {a) for ^/W x = h= \$(a*)\ = 3.6978, /i = |$(cr»)| = 
4.9535, fi = |$(cr b )| = 14.4859, and h = 20 

Since the critical point of $((t) 2 on 5* (i = 1,2,3) occurs at the same value of a that maximizes 
|3>(cr)| on Sq, that is, maximums of |$(<r)| on (i — 1,2,3) occur at a 9 , a^, and cr" respectively, we 
can see from Fig. [5] that a € S l a corresponds to an inflection point of P(x) when |$(cr)| > |$(cr)| for 
all a £ S l a and that cr ^ S l a if 

max |$(ct)| < \[~H\. 
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As always, a G corresponds to the global minimizer of P(x). 

When h — 0, the overlay of P(x) in blue and P d (a) in red is shown by Fig. [5] It can be seen that 
even though the circled points on P d (a) corresponding to solutions of ^{c) 2 — Hi are not critical 
points of P d (a), they still correspond to critical points of P(x). 




Figure 6: P(x) and P d (s) for h = 



6.2 Case n=2 

If we set di = 1 (i = 0,1,2), bo 
h = y/2[l 1] T , we get 

H x -- 
H 2 - 

Hz - 



[3 0] T , bi — 2, b 2 — 1, c = —1.5, c\ = — 1, c 2 = — 1, and 



arllhll 



a 

2a aic + 2a 6i - a-i || bp | 
2a 

2a\a 2 c\ + 2ai&2 — i2&i 
a 2 



= 4. 



as before; the canonical dual P d (a) also has the same form, and is shown in Fig. [2] A contour plot 
of P2(x) is shown in Fig. [71 Critical points are crossed. 

The corresponding critical points a are the same as in the previous example, but the (x, a) pairs 
are now, in descending value of a, 

(x!,cti) = ([-0.525, 2.475] T , 2.1299), 

(x 2 ,5- 2 ) = ([-5.416, -2.416] T , 1.8334), 

(xa,CT 3 ) = ([-5.086, -2.086] T , 0.3497), 

(x 4 ,o- 4 ) = ([-1.099, 1.901] T , -0.3864), 

(x 5 ,a 5 ) = ([-1.485, 1.515] T , -1.7043), 

(x 6 ,ct 6 ) = ([-4.332, -1.332] T , -2.2258), 

(x 7 ,ct 7 ) = ([-3.059, -0.059] T , -3.9965). 
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Figure 7: Contour of P2(x). Warmer colors correspond to larger values of P(x). 



From Figs. [2] and [71 we can see that x x is the global minimizer of -P2( x ) on x e E 2 and that <7i is 
the global maximizer of P d (a) on a € <S+. Thus, Xi is the solution to (P2) and cti is the solution to 
{V d ) over cr eS+. 

7 Conclusions 

In this paper we obtain the exact solution to the 8th order polynomial minimization problem, which 
was seldom discussed before. One advantage of using this way to find the critical points of P n (x) is 
that the largest corresponding critical point of P d (a) corresponds to the global minimizer of P n (x). 
Furthermore, we can see the efficiency of the canonical dual transformation method: it provides pow- 
erful primal-dual alternate approaches with zero duality gap and can reduce the number of dimensions 
in nonlinear programming. 
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